CDS

Accession Number TCMCG078C04110
gbkey CDS
Protein Id KAG0453281.1
Location complement(join(20834369..20834608,20834714..20834840,20835096..20835212,20835473..20835629,20835709..20835850,20835927..20836017,20836188..20836294,20837637..20837792,20840651..20840821,20840923..20841064,20841295..20841414,20841494..20841613,20841688..20841738,20843199..20843320,20843419..20843506,20843594..20843616))
Organism Vanilla planifolia
locus_tag HPP92_025945

Protein

Length 657aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000014.1
Definition hypothetical protein HPP92_025945 [Vanilla planifolia]
Locus_tag HPP92_025945

EGGNOG-MAPPER Annotation

COG_category S
Description Rhamnogalacturonate lyase
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K18195        [VIEW IN KEGG]
EC 4.2.2.23        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGGTTCACTCAGGAAACTAAGGAAGGCCATAGGAAAGTCCCTGTCCAGCACTTCTGTTGCTGTTCCAATGGCACCAAAGGGTGTGTGCTTATCGGTCTATGAATGTTATGTAGTGATAGACAATGGCATCCTAGAGCTCACATTGTCGAAGCCTGGGGGGATTGTTACAGGAATCAAATACAATGGCATCGATAATTTAATGGAGATACGTAACAAGGAAGATAATAGAGGGTATTGGGATCTAGTTTGGAGTGAACCTGGAAGTGCTGGTGTTTTTGACGTGGTTAAAGGGACAGATTTTGAAGTAATACTTGAAGATGAAAGCCAAGTAGAAGTTTCATTCATAAGGAGTTGGGATCTATCCCTAAAAGGCTCACGTGTGCCCTTGAAAATTGACAAAAGGTTTATTGTACTTCATGGTATGTCAGGGTTTTATAGTTATGCCATCTATGAGCATTTGGAAGGATGGCCTGATTTCAATATGGCAGAAACCCGAGTTGCTTTCAAGCTTAGGAAAGATAAGTTTCATTACATGGCTATGGCTGATGATAAACAGAGGATTATGCCAATGCCAGATGATCGGTTGCCAGGAAGATGTCAACAATTAGCATATCCTGAAGCAGTTTTTCTGAAAAATTCAATCAATCCTAACCTTATTGGAGAGGTGGATGATAAATATCAGTACTCATGTGATAATAAAGATAACAAGGTTCACGGCTGGATATCGCTAGAACCCCTTATTGGCTTTTGGCAAATTACTCCTAGTGATGAATTCCGGACTGGTGGACCTACAAAACAGAACCTCACCTCTCATGTTGGTCCCACTACACTAGCTGTTTTCGTCAGTGGTCACTACTCTGGTGATGCACTTGTTCCAAAATTCAGGAATGGTGAATACTGGAAGAAGGTCTTTGGCCCTGTTTTCATTTACCTTAATTCCAGTTTAGGAGAGACGGATCCGCGTGTTCTCTGGGAGGATGCGTATTTGCAGATGAAAACTGAGGTGGATTCCTGGCCATATGTGTTCCCTCTTTCAGAAGATTACCACAAGGCAAACCAAAGAGGCTCTGTTACTGGTAGACTTCTCGTGCGTGATAGGTACATTGATGATAATGATCTATATGCAAGTTCGGCGTACATTGGATTGGCCTTACCAGGAGAAGTGGGTTCATGGCAAAGAGAATGCAAAGGCTATCAATTTTGGACAAGGGCTGAGGTTAATGGGTCTTTCTCCATTAACAATGTTTTGACTGGAGAATACAATCTTTATGCTTGGGTTCCTGGTTTCATTGGTGATTACATGTATGGATTAACTATAACTGTAATGGCAGGAAATAGTATTGATCTTGGCGACCTCATATACCAACCACCAAGAGATGGCCCAACTTTATGGGAAATTGGATACCCAGACCGTTCTTCTGCAGAGTTCTATGTTCCAGAACCTAACCCCAACCATGTAAACAAGCTTTATCTCAATCATCCTGAGAGATATAGGCAATATGGACTTTGGGAGAGATATGCAGATCTTTATCCAGAAGGTGATCTTGTTTACACCATTGGCCTTAGTGACTGCAAGAAGGATTGGTTCTTTGCTCATGTTACCAGGAAAAATGAACAAAATTCGTATTCCCCGACAACTTGGCAGATCAGATTCAAACTCAACTGTATCGATCAAGCTGGAATATACATGCTTCGTGTTGCGATTGCATCTGCAACGCTCTCTGAACTGCAAGTTCGATTCAACGACATCAAAGCTAGTCCTGCTCACTTCTCAACTGGCCTCATCGGTAGAGACAACTCGATCGCTAGGCATGGAATTCATGGCCTATATTGGTTATACAACATTGTGGTTCAAACCAGGTGGCTTCTTGAGGGAGAAAATACTATATTTCTCACACAGGCAAGAAGTGCAAGTCCTTTCCAAGGAATAATGTATGACTACATTCGTCTTGAAGGGCCATTGAACTCTTGA
Protein:  
MGSLRKLRKAIGKSLSSTSVAVPMAPKGVCLSVYECYVVIDNGILELTLSKPGGIVTGIKYNGIDNLMEIRNKEDNRGYWDLVWSEPGSAGVFDVVKGTDFEVILEDESQVEVSFIRSWDLSLKGSRVPLKIDKRFIVLHGMSGFYSYAIYEHLEGWPDFNMAETRVAFKLRKDKFHYMAMADDKQRIMPMPDDRLPGRCQQLAYPEAVFLKNSINPNLIGEVDDKYQYSCDNKDNKVHGWISLEPLIGFWQITPSDEFRTGGPTKQNLTSHVGPTTLAVFVSGHYSGDALVPKFRNGEYWKKVFGPVFIYLNSSLGETDPRVLWEDAYLQMKTEVDSWPYVFPLSEDYHKANQRGSVTGRLLVRDRYIDDNDLYASSAYIGLALPGEVGSWQRECKGYQFWTRAEVNGSFSINNVLTGEYNLYAWVPGFIGDYMYGLTITVMAGNSIDLGDLIYQPPRDGPTLWEIGYPDRSSAEFYVPEPNPNHVNKLYLNHPERYRQYGLWERYADLYPEGDLVYTIGLSDCKKDWFFAHVTRKNEQNSYSPTTWQIRFKLNCIDQAGIYMLRVAIASATLSELQVRFNDIKASPAHFSTGLIGRDNSIARHGIHGLYWLYNIVVQTRWLLEGENTIFLTQARSASPFQGIMYDYIRLEGPLNS